Comparisons of Different Feature Sets for Predicting Carbohydrate-Binding Proteins From Amino Acid Sequences Using Support Vector Machine
نویسندگان
چکیده
under my guidance and supervision and be accepted in partial fulfillment of the requirements for the degree of Master of Computer Science & Engineering. The research results presented in the thesis have not been included in any other paper submitted for any degree to any other University or Institute. It is understood that by this approval the undersigned do not necessarily endorse or approve any statement made, opinion expressed or conclusion drawn therein but approve the thesis only for the purpose for which it has been submitted. All information have been obtained and presented in accordance with academic rules and ethical conduct. I also declare that, as required by these rules and conduct, I have fully cited and referenced all materials and results that are not original to this work. Acknowledgement The main objective of this project is to predict the carbohydrate-binding proteins using efficient machine learning methods. During the period I have come to know various databases of proteins, tools and techniques for classification and related works. I would like to express my gratitude to Prof. Mita Nasipuri, for her valuable guidance, advice and supervision that have served as a pillar of strength and confidence throughout my M.E Thesis work. I am privileged to witness her enthusiastic and dedicated interest towards advanced research that motivates me and enriches my growth as a student and a researcher. I am indebted to her for her unflinching support and the innovative ideas and extraordinary experiences that she have shared with me during the course of this research. It is an honor for me to thank heartily to Prof Mahantapas Kundu of the Computer science and engineering Department, Jadavpur University whose truly scientific intuition and crucial contribution have paved the way towards successful execution of this research work. for his unsolicited help for any type of software related problems. I would also like to thank Smt. Piyali Chatterjee for her effective advices, during data collection process. I would like to extend my appreciation to my friends and seniors without whose support and cooperation, this thesis would never been possible. Last but not the least I would like to thank all my family members for giving me constant encouragement and mental support during my project work.
منابع مشابه
Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines
Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs). We first clarified the definition of carbohydrate-binding proteins and then constructed positive and neg...
متن کاملDevelopment of Prediction Method for GPCR G-protein Coupling Selectivity Using Amino Acid Properties
We describe a novel method for predicting G-protein coupled receptor (GPCR) G-protein coupling selectivity using amino acid properties of specific residues in GPCR sequences. We have evaluated various amino acid properties obtained with experimental or theoretical studies. The GPCRs having reliable G-protein binding information were collected from Guide to Receptors and Channels (GRAC) and gpDB...
متن کاملIn silico comparison of Iranian HIV -1 envelop glycoprotein with five nearby countries
HIV-1 envelope (env) glycoprotein mediates an important role in entry of the virus into the susceptible target cells. As env glycoprotein of HIV-1 is highly variable in the different geographical regions, in the present study, different properties of this protein in Iran are compared with five nearby countries. The sequences of HIV-1 env glycoproteins of Iran, Afghanistan, Russia, Turkey, Pakis...
متن کاملProtein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملPredicting the Phosphorylation Sites Using Hidden Markov Models and Machine Learning Methods
Accurately predicting phosphorylation sites in proteins is an important issue in postgenomics, for which how to efficiently extract the most predictive features from amino acid sequences for modeling is still challenging. Although both the distributed encoding method and the bio-basis function method work well, they still have some limits in use. The distributed encoding method is unable to cod...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012